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Abstract 


Some of the perturbation methods in classical Hamiltonian 
mechanics lead to near-identity transformations of the variables, 
•with the new variables explicitely given as functions of the old 
ones. Two methods are used for identifying and characterizing the 
sub-class of all such transformations which are also canonical; 
one approach is related to the conventional method of generating 
canonical transformations, while the other one uses the properties 
of Poisson brackets and is related to an operator method of Lie. 
Either of the methods may be used to derive certain steps in a 
perturbation method devised by Laeina, inadvertedly omitted by that 


author. 



INTRODUCTION 


Let a classical canonical system be given, described by 2n canonical variables 
collectively denoted by the vector y . The components y may be divided into 
canonical momenta p_^ and canonical coordinates q^ , and we shall assume that 
their order is 

y = (p x P n » ••• %) 

= ( p » q ) (x) 

A transformation y -e> z is termed canonical if the new variables also 
form a canonical set 

3 . ~ ( P » Q ) 

Canonical transformations are customarily defined by means of a generating 
function cT depending orL n new n 0 ;j.q variables (and possibly on 

time, though this will not be assumed in what follows). Of the several possible 
types of such functions, we shall choose to use those of the form 

O' = O' ( P . i ) (2) 

for which the transformation equations are given by 

n> <r/n> p. « ^ 

(3) 

'7)0r/^q i = P i 

These equations give the transformations indirectly: if the old variables are 
given and the new ones are sought, additional "untangling" is generally in order. 

By contrast, a "direct" transformation 
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z = z.(y) 

is immedi a tely usable. In particular, we shall be concerned, with near-identity 
transformations expanded in- terms of a small parameter £. 1 (bracketed 

superscript denotes order in £• )s 

2 = y + I 6 k ^ k \y) (4) 

“ k=l - 

In what follows, we shall investigate the conditions under which the 
transformation (4) is canonical. 

Although not essential to what follows, the motivation for this investigation 
may be of some interest. In the classical perturbation theory developed for 
celestial mechanics, the basic problem is the solution of a canonical system 
depending on a small parameter £ , given that in the limit of vanishing £. 
("unperturbed motion") the solution is known and periodic. One procedure used 
•there (usually associated with the names of Von Zeipel and Poincare) is as 
follows . First, one prepares the ground by transforming the original 
problem to new variables — which will be labeled y as in eq. (l) — so that 

one of the new variables is angle -like and represents the periodicity of the 
unperturbed motion, while the others, in the limit of vanishing £. , are constant 
in time. If the angle variable is conjugate to the variable y^ , the new 
variables may be chosen so as to make the hamiltonian H equal y^ as well. 

The same variables are then introduced into the finite- £ problem, in which 
case the Hamiltonian assumes the form 

H - yj. + Et k H (l) ( ? ) (5) 

A near-identity canonical transformation is now sought, generated by 
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<T( P, q) = n P ± + L £.* GT w (p, a) (7) 

i=l k=l 

such that in the new variables z_ , the angle variable is absent 'from the 
transformed Hamiltonian , making its conjugate a constant of the motion. 

Further solution of the motion involves neither of these variables as an unknown, 
so that the problem has been reduced by two variables, or equivalently, by one 
dimension. 

(4) 

A different approach, independently developed by Kruskal, , is related to 
the method of Bogoliubov and Krylov, and describes the evolution of a system 
through 2n equations of the form 

dx/dt = f^(x) (8) 

k=0 

The method does not require the system to be canonical, but we shall 
restrict ourselves here to problems where this is the case. The preparatory 
derivation of "intermediate variables" y is then ver^ much the same as before. 

To solve the perturbed problem, however, the method seeks a direct transformation 
of the form (4) , with the angle variable transformed into one of the components 
of z which will be denoted (for reasons which will be clarified later) as . 

The transformation is such that when the equations (8) are transformed to give 
the evolution of zr 

dz/dt - £ k (9) 

k=0 

then z. is absent oh the right hand side. One can now separately handle the (2n~l) 
equations in (2n-l) variables obtained by omitting the equation for dE^/dt 
from the above set. One can also obtain a constant of the motion, the so-called 
adiabatic invariant J , thus removing one more unknown variable from the problem. 
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The details of J will not be described here beyond noting that it resembles 
in many ways the constant action variable obtained by the preceding method 
— for instance, the two may be shown to be equal at least to order £? 

The two methods are evidently similar, and one may ask whether they can be 
made to coincide, by a suitable choice of the arbitrariness existing in Kruskal*s 
method^. For this to happen, the transformation (4) obtained by Kruskal's 
approach must be canonical, which leads one to the basic problem stated before® 

As will be shown, the matter is also intimately connected to the work of Lacina^]^ 
where it leads to the correction of an error® 

Two approaches to the problem will be described here: the first one is 
concise and is derived from the conventional form of canonical transformations, 
while the second one appears to be more elegant and has interesting geometrical 
implications. It is related to an operator method for characterising direct 

(s) 

canonical transformations, originally devised by Lie and recently applied to 
celestial mechanics by Hori^ and Deprit^*^. 

DIRECT SOLUTION 


If the transformation (4) is canonical, then there must exist a generating 
function o' (p, q) of the form given in (7) that leads to it® Applying (3), one 
obtains 


\ - % 


k=l ” 


P. 

1 


( 10 ) 

( 11 ) 


= P ± “ £ £ k {^)cr k (P, q)/(iq.} 

If the ordering of the components of £ ^ corresponds to the ordering 
of the components of y in (l), one may define "partial vectors", the sum of 
which equals 
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JT 


(k) 


= ( S, 00 0 . 

°1 -*n 


(k) 


(k) 


(k) 


§ w - ( o...o, j; +1 

which allows (4) to be split up into 


.. 0 ) 

■ iS?) 


(12) 


\ - % - Ee k e«(?) 

. + E £ k '*< k) (y) 

le=l 


P. = p„. 

X X 


(13) 

(14) 


Substituting in (ll) 


P. = p. 
1 *x 


- L 6 k ^cr (k) [p + E £. m Jl^Cy), q'] /~bq_. (15) 

k=l L " m=l ' 

We now introduce expansion operators ^ 11 ^) g(k) sucil that for any function 
f of the .canonical variables ( * means ’operates on’) 


(i =s 1 n) 


.(m), 


f(P, q) = JZ £ S 

m=0 


(m) 


f(p» q) 


(16) 


The explicit expressions for may be obtained by replacing £ W with 

^ in equations (13) of reference 5 i they involve V operators in 


(k) 


y space acting on the Jy We then get being unity) 

P = Pi - £ £ k (lcr (k) /'& q. + £ S (m) * 7Kr (k ‘ m) A qA (17) 

k=l t- • m=l J 

where the various orders of O' on the r.h.s. are functions of y only and may 

be obtained from those appearing in eq. (7) by replacing I? everywhere with p . 

This is formally equivalent to (14) and therefore 

J1< k) = - W w <y)/-! 1% - tfsKe ^ 

1 1 m=l 


( 0 ) 


( 18 ) 
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Since the appearing here all involve lower orders of ^ , this 

(k) 

ia a usable recursion f oimula, allowing 7\ for the direct transformation, to 

(k) 

be solved — provided the (j are known and provided the lower orders have 
already been solved* Similarly, one gets 


n (k) 
” i+n 


■»<r W <y)A p. - 13 n 

m=l 


k-1 




.(k-m) 


(19) 


The last two equations can he joined together by introducing the concept 
of the conjugate vector y , formed by permuting the order of components of y 
given in equation (l) to 


y = 


( q* -P ) 


( 20 ) 


Several properties of y are described in reference 5 5 using them, one 
may express Poisson brackets as ( V operator defined in y space) 

£_a, b} = V a * V* ( 2 l) 

and in particular 

[a* y ± l = ^a/^y. ( 22 ) 

With this notation, (l8) and (19) may be combined to 


•S 


Ck) 


■with 




(k) 


va (k) •+ 


k-1 


»(k) 


(23) 


- a sK -aa-^/Jy* (24) 

m=l 


being determined only by lower orders. In spite of its external appearance, this 

(m.) 

relation is not free from partial vectors, since the JP appear in the S 
operators. A criterion for canonicity is now easily -established. If the trans- 
formation (4) is canonical up to and including the order (k-l), one may form the 


u 


(k) 




(k) 


J 


(k) 


vector 
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(k) 

and the transformation will he canonical if and only if u is a gradient in 


y space, that is, if the components of the "conjugate curl" tensor of u 


(k) 


( V x u (k) ). , = 


i>4 k) /^y. - y. 


(25) 


all vanish. 


SOLUTION BY POISSON BRACKETS 


A different approach involves no partial vectors and is "based on the properties 
of Poisson brackets. If both y and _z are canonical, and equal in the zeroth 
order, then 

[y±. y d l = tv z ol (26) 

the value of the brackets being 1 or 0 depending on whether canonical conju- 
gacy exists or not. Substituting the expansion (4) on the r.h.s. and equating 
each order in £ separately, one gets as the condition for canonicity a set of 
relations, which through (22) and (25) can be expressed as 

m=l 

Of more interest than a criterion for canonicity would be derivation of a 

method for actually generating direct canonical transformations. Equations (27.) 

( k ) 

are not very convenient for this, since they involve the components of £ 
only through combinations of their derivatives. A method for integrating these 
equations would obviously be useful here. For .instance, the first equation of the 

3et (1) 


(V X \ M ) 13 


(27) 


the 


0 

Pi V (l) 


V X £ 

may be integrated to / -, \ , , , , 

= VX UJ (28) 

with an arbitrary scalar. Similarly, an arbitrary conjugate gradient may 

(k) 

be added to any of the £ v , since such a gradient is ignored by the curl 
operation. We may thus formally write the most general solution of (27) as 
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% {k) = n w + (29) 

where is any vector depending only on lower orders of £ which gives 

one particular solution of that equation. Comparing (29) with (25) shows that one 

(k) 

possible choice of f v 1 is 

f( k ) = ( 5 0) 

which carries with it the identification 

•X (k) = -<r (fc) 


The drawback of this choice (more esthetical than practical) is that f/ ^ contains 
partial vectors. Solutions which do not split up phase space into coordinates and 
momenta also do exist, to any order. The first of these is almost trivial: 

by (28), f^ vanishes. The next three vectors are 


f (2) = 4- ,VS (1) 

f (4) - + 4 | (2 U| (2) 

+ | (2) - 

- *[(f ) .n<V$ <l) >S (l) 


(51) 


They are far from unique , since various "curl-free" expressions, involving 
only the 5 V > may be added to any one of them. A general method for deriving 
such expressions, based on Lie's method for characterizing direct canonical trans- 
formations, will now be described. 
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LIE’S METHOD 


Let W(y) be an arbitrary function of a given set of canonical variables. 

One then defines^ 0 ^ the Lie derivative generated by W of any function f (y) as 
the function 

V f) = [ f - 1 ] 

The operator L^ is linear 

LyUf + j ygj 

and its action on a product resembles that of the derivative 
L^fg) = f hy( g) + g Ly(f) 

By means of Jacobi's identity one can prove similar properties for L^. acting on 
Poisson brackets, except that the order of terms must now be preserved: 

hi [ f ’ g | = g J + [ f * 

By successive application of L^ , various powers L^ n of the operator 'may be 
defined, and for completeness one then includes as the identity operator. If 

■>{ is a constant, L^^ihus vanishes for all values of n except zero, since 
all powers of L^ except the zeroth involve differentiation. 

Using the preceding definitions, one may define an exponential operator 


exp ( £1^) * f = ( 1 + £.\ + ife 2 I^ 2 + ..) * f 

where £ is a constant much smaller than unity which helps- the expression 
converge (convergence will not be discussed here, however). The property of the 
exponential operator of importance here is 

exp ( & L w ) *[f , g] = [ exp ( £ Iy) * f , exp ( £, L w ) * g j 


( 32 ) 
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The preceding transformation shows that if y is canonical, then for any W 

z ~ exp £,1^ * y (53) 

is also canonical, for we then have 

C*i> = sxp . [y., yj] 

the latter Poisson brackets being always a constant, equal to 1 or 0 . By (32), 
the transformation is a near-identity one, even if the "generating function" W 
does not depend on £ . It is, nevertheless, possible to include such a depen- 
dence , expressed in a power series in £ 

W(y, £} = - Hi £ k W^'(y) (34) 

k=0 

We now prove the following. Let a direct canonical transformation be given 
by Lie’s method as in (33), with W expanded as in (34). Then the same trans- 
formation may also be expressed as in (4), with given as in (29). If 

we choose to identify 

% (k) = -W< k - lJ (35) 

then the two approaches may be readily related and an explicit expression for 
ff k ^ may be found. 

We begin with the following lemma ; suppose that (35) holds, that i 4 k, 

that the*" function" g( ^ ) is an expression involving orders of lower 

(m) ~~ 

than the k-th and that f ' { £ ) is known for values of m smaller than k . 

Then a "function" h( ^ ) , similar in structure to g and may always be 

W* -1 ] = h(|) 


found so that 
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Proof: using (2l), (29) and {35) 

[g, w 1 - 1 ] * [* (i) . ej 

= VK (i Ug(P 

( ) ) *vg($ ) 

and the last expression has the required form* 

Several corrolaries are now easily derived: 

(1) The preceding result is still valid if g is replaced by a vector 

g( £ ) in y space, in which case h is also replaced by a vector h . 

(2) If multiple nested Poisson brackets are given, under the same conditions 
as stated before and of the form 

r 

... [*(§), V 1 " 1 ] , W j_1 , ... W 3 ** 1 - (36) 

they may still be reduced to the form h( § ). To prove this one only has to 
note that the innermost brackets may be thus reduced, then the innermost 
brackets of the remaining expression, and so on until all brackets have 
been eliminated. 

(3) The preceding still holds if g( £ ) in (36) is replaced by y . For the 
innermost brackets then become 

[y, W 1 " 1 ] = y] 

= V X (i) 

= 

which has the form of h ( 5 ) • The remaining brackets may then be removed 


as before. 
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We now prove the main assertion. Let 

4 k) ( f ) - [g f W (k_l) ] 

(note displacement of order index) so that 

^ ■ £* k 4 k) 

With this, notation, the expression (36) may he rewritten 


and as was noted, this can he reduced to the form h( ^ ), as can the analogous 
expression with y replacing g » 

We now have, hy ( 32 ) 


- { 1 + L& k 4 k) - *(& k 4 k) ) 2 + -- l*? 


E 


(38) 


r(k) 


where the M' J are expansion operators resembling those of (l6)» with the 

(k) 

difference that account must be taken of the fact that ; operators 

with different values of k do not commute. One has, for instance (compare 
the last of eqs, ( 13 ) in ref. 5 ) 

m (3) = 4 3) ♦ k 4 2) ♦ 4 2) 4 1 ) * (1/6) (#¥ 

Let us denote 


- L^ 


Then, since 
one obtains from (38) 


4 k) (y) = [ 7 . w tel) J= - vw (Ka) 


= jr - £ £, k VW i *- l) + E t k x ik) *y 

k=l k=l 


( 39 ) 
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Since N^* y consists only of terms of the form (37), it may he reduced to 
a "function” h(^ ). Comparison of the last equation with (4) and (29) then 
identifies this function with f^(£), provided (35) holds. This completes 
the proof of our original assertion. 

LACINA'S EXPANSION 

Lacina^^ has published what he claims is a simple new canonical pertur- 
bation method, leading to results similar to those obtained from the Hamilton- 
Jacobi equation. Unfortunately, the simplicity is more apparent than real, for 
two reasons. First, there exists an important omission in the calculation, and 
secondly, there is no assumption of near-periodicity, so that the elimination 
of secular terms may be dispensed with. Perturbation calculations for systems 
without periodic character are possible, but of little interest, since their 
range of validity in time is usually quite limited. It is the periodic character 
inherent in the problems of celestial mechanics and of guiding center motion 
which makes possible solutions valid over long intervals in time, provided secular 
terms are eliminated e 

Lacina's result is easily derived by the preceding formalism and, in fact, 
our notation allows more concise treatment than is found in the original articles, 
which use a separate notation for canonical coordinates and momenta. Let a near- 
identity canonical transformation be given by a direct relation as in (4) ; the 

. (k) 

transformation is then fully specified by the various orders of yL appearing 
in (29), assuming of course that a paticular choice of ff^ has been selected. 
This choice could be the one of (30), in which case ~)L ^ is the k-th order 
of the conventional generating function CT , with sign reversed, or it may be 
the one derived in (39): any such choice may be used in what follows. 

By using (29), any single component of the direct expansion, e.g. 
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(k) 


+ f[ k) 


(40) 


may be used to define (within a cert a in arbitrariness) and consequently 

the transformation* This is essentially the idea behind Lanina' s approach* To 
define the transformation by means of he U8es the (i, l) component 

of (27) 

- ^5^/7.^ - e rsi m) . 


(41) 


from which 


$i k) - f 1 - j? + /ti k) (y) 


(42) 


where y is the vector formed of the (2n-l) components of y excluding + y^ 
(the sign being adjusted so that this is a component of y ) . 

(k) 

The lower limit is arbitrary, but its choice affects J* ^ ' , which has 
to be chosen in a way assuring that (27) also=‘ J holds when neither x nor j 
equals 1 . To handle such cases, we express the Poisson brackets by means of 
f^, which is known to be a solution for in (27) 

J- 


ii k) 


- J 1 • ''^ k V'jy i + T>4 k)/ ^ 5 i " d5 x 


+ jA [ k \y) (43) 

As one forms the (i,j) ; element of the curl of the vector of which the above 
expression is one component, the contributions of ^ and of in the 
integrand cancel out and one is left with 
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( Vx ^ 


(k)-v 


f 7{)( V X dyj_ + ( V x p (k ))_ 

J c (44) 

(V X f tk) ) - (V x f (k) (y =C) ) + ( ^ x ^l (k) ).. 

-L XJ lj 


Here the l.h.s. cancels with the first term on the right and one is left with 
the condition 

JA = f^^y^c) + VS^ k) (y) (45) 

where ^ ^ k ^ is any function of y (or zero). Because only the curl of ff k ^ 

is involved, any choice satisfying (29) may be used. Hone of this appears in 
(7) 

Lacina’s work , because the contributions of the lower limit of integration 
are inadvertently omitted in the last steps of the equations following 

eq. (15) and (16) there. 

Let us assume that the unperturbed problem has been prepared so that the 
Hamiltonian has the form given in eq. (5). Following Lacina, we now stipulate 
that in the new variables the Hamiltonian equals z^ . Since the transformation 
is time independent, thi3 new Hamiltonian equals the old one, given in (5), 
leading to the identification 


^ k) (y) = H (k) (y) 


(46) 


Substituting this choice of into (42) and evaluating £ ( k ^ by 

means of (45) allows the other to be derived to any desired order. Since 
the new Hamiltonian equals , the valuable conjugate to will be linear 

in time and all other new variables (z^ included) constants of the motion. The 
problem is thus essentially solved. 

If one defines 


( 0 ) 


Si' 


and loses (42) and (22) in (4l), then the basic equation converts to 
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U ^ X 

111=0 

/ \ r V 

This is the starting point of the method of McNamara and Whiteman V ^ 

( 13 ) 

which in turn is related to Whittaker's adelphic integral , Contopoulos 1 
third integral*'' 14 '^ and to other approaches quoted by Contopoulos. The diffe- 
rence is th at McN amar a and Whiteman assume a periodic character of the motion 
nnrt include an extra step to ensure elimination of secular terms. While 

(15) 

their aim is to generate one invariant only, cases may exist in which a 
complete set of invariants can be generated, yielding a solution similar to 
Lacina’s but free from secular! ty<, 
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